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METHOP ANP APPARATUS FOR jREPUCJNG THF COMPUTATJONAI. 
COMPLEXITY AND RELAXING THE C WTIC AL PATS OF REDUCED STATE 
SEQUENCE ESTIMATION (RSSE) TECHNIQUES 

5 Field Qfthe Inyggtjop 

The present invention relates generally to channel equalization and decoding 
techniques, and more particularly, to sequence estimation techniques with reduced complexity. 

Background of the Invention 

10 The transmission rates for local area networks (LANs) that use twisted pair 

conductors have progressively increased from 10 Megabits-per-second (Mbps) to 1 Gigabit-per- 
second (Gbps). The Gigabit Ethernet 1000 Base-T standard, for example, operates at a clock rate 
of 125 MHz and uses four copper pairs to transmit 1 Gbps. Trellis-coded modulation (TCM) is 

o 

^5 employed by the transmitter, in a known manner, to achieve asymptotic coding gains. The 

Ly 

?yl5 signals amving at the receiver are typically corrupted by intersymbol interference (ISI), 
crosstalk, echo, and noise. A major challenge for receivers in such a channel environment is to 

m jointly equaUze the channel and decode the corrupted trellis-coded signals at such high clock 

y 5 

3 rates. As the high processing speed requires a parallel implementation without resource sharing, 
^ managing hardware complexity becomes difficult. Another issue is to meet the speed 
P20 requirements, as the algorithms for joint equahzation and decoding incorporate non-linear 
feedback loops which cannot be pipelined. 

Data detection is often performed using maximum likelihood sequence estimation 
(MLSE), to produce the output symbols or bits. A maximum likelihood sequence estimator 
(MLSE) considers all possible sequences and determines which sequence was actually 
25 transmitted, in a known manner. The maximum likelihood sequence estimator (MLSE) is the 
optimum decoder and applies the well-known Viterbi algorithm to the combined code and 
channel trellis. For a more detailed discussion of a Viterbi implementation of a maximum 
likelihood sequence estimator (MLSE), see Gerhard Fettweis and Heinrich Meyr, "High-Speed 
Parallel Viterbi Decoding Algorithm and VLSI- Architecture," IEEE Communication Magazine 
30 (May 1 99 1 ), incorporated by reference herein. 
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The computation and storage requirements of the Viterbi algorithm are 
proportional to the number of states. The number of states of the combined trellis is given by 
5x2"*^, where S is the number of code states, m is the number of bits for each information 
symbol, and L is the length of the channel memory. For the Gigabit Ethernet standard, for 
5 example, S = 8, /w = 8, and I«10, which leads to a prohibitively expensive Viterbi algorithm 
with about 10^^ states. 

In order to manage the hardware complexity for the maximum likelihood 
sequence estimator (MLSE) that applies the Viterbi algorithm, a number of sub-optimal 
approaches, such as "reduced state sequence estimation (RSSE)" algorithms, have been proposed 
10 or suggested. For a discussion of reduced state sequence estimation (RSSE) techniques, see, for 
example, P. R. Chevillat and E. Eleftheriou, "Decoding of Trellis-Encoded Signals in the 
fj Presence of Intersymbol Interference and Noise", IEEE Trans. Commun., vol. 37, 669-76, (July 
^ 1989) and M. V. Eyuboglu and S. U. H. Qureshi, "Reduced-State Sequence Estimation For 

W Coded Modulation On Intersymbol Interference Channels", IEEE JSAC, vol. 7, 989-95 (Aug. 

y ^ 

\J15 1 989), each incorporated by reference herein. 

rt^ Generally, reduced state sequence estimation (RSSE) techniques reduce the 

^ complexity of the maximum likelihood sequence estimators (MLSE) by merging multiple states 
yl of the full combined channel/code trellis. Although RSSE techniques reduce the number of 
5 states for Viterbi decoding, the required computations are still too complex at the high clock rates 
associated with the Gigabit Ethemet standard, as the high processing speeds require a parallel 
implementation without resource sharing. In addition, the RSSE technique incorporates non- 
linear feedback loops which cannot be pipelined. The critical path associated with these 
feedback loops is the limiting factor for high-speed implementations. Simplifying the RSSE 
technique by further reducing the number of states or by doing separate equalization with a 
25 decision-feedback equalizer (DFE) and decoding of the TCM codes comes often with a 
significant penalty in terms of signal-to-noise ratio (SNR) performance. As apparent from the 
above-described deficiencies with conventional reduced state sequence estimation (RSSE) 
algorithms, a need exists for a reduced state sequence estimation (RSSE) algorithm that reduces 
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the hardware complexity of RSSE techniques for a given number of states and also relaxes the 
critical path problem. 

Summary of the Invention 

Generally, a method and apparatus are disclosed for reducing the complexity of 
the RSSE technique for a given number of states while also relaxing the critical path problem. A 
communications channel is represented using a discrete time model, where the channel impulse 

response has a memory length, L, denoted by [f^ where is the coefficient for channel tap 
k. The signal energy of a pulse that has gone through a minimum-phase channel is concentrated 
in the initial taps. As used herein, taps one through U are referred to as the initial taps, and taps 
U+1 through L are referred to as the tail taps, where U is a prescribed number. In one 
implemenatation, the tap number, U, is selected to ensure that the initial taps contribute a 
predefined percentage of the overall signal energy. 

According to one aspect of the invention, the less significant tail taps (U+1 
through L) are processed with a lower complexity cancellation algorithm, such as a decision- 
feedback equalizer (DFE) technique, that cancels the tail taps using tentative decisions. 
Thereafter, only the more significant initial taps (1 through U) are processed with a reduced state 
sequence estimation (RSSE) technique. The DFE technique initially removes the intersymbol 
interference associated with the tail taps, then the RSSE technique is applied only to the more 
important tail taps. Thus, only taps one through U are processed using the RSSE technique, 
while taps U+1 through L are processed with a lower complexity decision-feedback equalizer 
(DFE). The present invention does not further reduce the number of states which are processed 
in the RSSE circuit, thus ensuring a good bit eiTor rate (BER) versus signal-to-noise ratio (SNR) 
performance for a well-chosen value of U. Meanwhile, the computational complexity and 
processing time of the decision-feedback computations in the RSSE circuit are substantially 
reduced. The hardware complexity of the survivor memory unit (SMU) in the RSSE circuit can 
also be reduced. 

A receiver is disclosed that includes a tentative decision/tail processing circuit for 
processing the less significant tail taps and an RSSE circuit for processing the initial taps. The 
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tentative decision/tail processing circuit processes the less significant tail taps with a lower 
complexity DFE algorithm, to cancel the tail taps using tentative decisions. The RSSE circuit 
processes only the initial taps with the RSSE technique. 

5 Brief Descrintion of fhi. r>B-q^inp^ 

VIG./Ks^ schematic block diagram of a conventional receiver; 

FIG./is^a schematic block diagram of a receiver in accordance with the present 
invention; / 

FIg/s illustrates the signal energy of a pulse that has undergone dispersion 
10 through a minimum-phase channel; 

FIG./4 ijju^strates an implementation of the tentative decision/tail processing 
^ circuitry of FIG. 2; and 

y / 

g FIG^ illustrates an implementation of the reduced state sequence estimation 

py (RSSE) circuitry ofTIG. 2. 

H Detailcfl npsp^ppfinp 



FIG. 1 shows the block diagram for a conventional receiver 100 in a channel 
enviromnent associated with, for example, the Gigabit Ethernet 1000 Base-T standard. A major 
^ challenge for such receivers 100 is to jointly equalize the channel and decode the corrupted 
m treUis-coded signals at the high clock rates of the Gigabit Ethernet 1000 Base-T standard. As 
shown in FIG. 1, the receiver 100 includes an analog-to-digital {AID) converter 100 for 
converting the received analog signal to a digital signal. The digitized data is then processed by 
a feed forward equalizer (FFE) 120, an echo canceller 130 and a crosstalk canceller 140. 
Generally, the feed forward equalizer (FFE) 120 makes the chamiel impulse response causal and 
minimum-phase, and additionally whitens the noise, hi addition, the echo canceller 130 removes 
echo from the received signal and the crosstalk canceller 140 removes the crosstalk, in a known 
mamier. The equalizer/decoder 150 performs data detection, for example, using maximum 
likehhood sequence estimation (MLSE), to produce the output symbols or bits. 
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FIG. 2 illustrates a receiver 200 in accordance with the present invention that 
reduces the hardware complexity of reduced state sequence estimation (RSSE) algorithms for a 
given number of states, while also relaxing the critical path problem. A communications channel 
is represented using a discrete time model, where the channel impulse response has a length, L, 

denoted by {/t}^^o» where is the coefficient for channel tap k. In minimum-phase channels, 
the signal energy of a pulse that has undergone channel dispersion is concentrated in the initial 
taps. As shown in FIG. 3, the initial taps provide the largest contribution to the signal energy of 
the channel output, and the corresponding power decreases to zero as the taps approach infinity. 
As used herein, taps one through U are referred to as the initial taps, and taps U+1 through L are 
referred to as the tail taps, where U is a prescribed number. For example, the tap number, U, can 
be established using simulations or experimental results to ensure that the initial taps contribute a 
predefined percentage of the overall signal energy. 

According to a feature of the present invention, the less significant tail taps are 
processed with a lower complexity cancellation algorithm, such as a decision-feedback equalizer 
(DFE) technique, that cancels the tail taps using tentative decisions. Thereafter, only the initial 
taps are processed with a reduced state sequence estimation (RSSE) technique. Thus, the DFE 
technique initially removes the intersymbol interference associated with the tail taps, then the 
RSSE technique is applied only to the more important tail taps. Thus, for a channel having a 
memory, L, taps one through U are processed using the RSSE technique and taps U+l through L 
are processed with a lower complexity decision- feedback equalizer (DFE). 

FIG. 2 is a schematic block diagram of a receiver 200 in accordance with the 
present invention. The receiver 200 includes a sheer 210 that slices the digital data into symbol 
values. In addition, the receiver 200 includes tentative decision/tail processing circuitry 400, 
discussed further below in conjunction with FIG. 4, for processing the less significant tail taps 
with a lower complexity cancellation algorithm, such as a decision-feedback equalizer (DFE) 
technique, to cancel the tail taps using tentative decisions. The receiver 200 also includes RSSE 
circuitry 500, discussed further below in conjunction with FIG. 5, for processing only the initial 
taps with a reduced state sequence estimation (RSSE) technique (FIG. 5). 
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FIG. 4 illustrates a decision-feedback equalizer (DFE) implementation of the 
tentative decision/tail processing circuitry 400 of FIG. 2. As shown in FIG. 4, a feedback filter 
(FBF2) 410 takes tentative decisions obtained from a DFE structure (FBF) 400 and removes the 
less significant intersymbol interference (ISI) introduced by the tail channel taps {fk}k=u+\ ^ where 
5 K<U <L. The remaining severe intersymbol interference (ISI) introduced by only the initial 
channel taps [fkU^^ then treated in the RSSE circuitry 500. 

When t/ = L , the output of the feedback filter (FBF2) 410 does not affect the input 
to the RSSE circuitry 500 such that the structure operates like a full reduced state sequence 
estimator (RSSE). If K is the number of taps that are accoimted for in the combined code and 
10 channel state inside the RSSE circuitry 500, then choosing U = K leads to a structure, where 
feedback filter (FBF2) 410 cancels all intersymbol interference which is not accounted for in the 
I combined code and channel state. It is again noted that the design parameter U can be chosen to 
i trade-off performance and hardware complexity. 

The lower the value for the threshold, C/ , the less complex the decision feedback 



^^15 unit (DFU) in the RSSE circuitry 500, discussed below, as less intersymbol interference taps are 
accounted for in each decision feedback cell (DFC). However, this comes at the expense of a 



signal-to-noise ratio (SNR) penalty due to error propagation effects in the feedback filter (FBF2) 
g 410. Low values for U also relax the critical path problem in the feedback loop inside the RSSE 
=p circuitry 500 as the number of terms which have to be added in the decision feedback cell (DFC) 
^ is proportional to U . The critical path inside the RSSE circuit, consisting of the decision- 
feedback cell (DFC), branch metric cell (BMC), add-compare-select cell (ACSC) and survivor 
memory cell (SMC), as shown in FIG. 5, is the bottleneck for high speed implementations of the 
RSSE technique. The tentative decision/tail processing circuitry 400, according to FIG. 2, is not 
part of the critical path. 

25 In many practical situations, where the channel is minimum-phase, which can be 

accomplished with a feed forward equalizer (FFE), a low value of U is sufficient to achieve 
approximately the same bit error rate (BER) versus signal-to-noise ratio (SNR) performance as a 
conventional RSSE circuit, which cancels the intersymbol interference introduced by all L 
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channel taps. However, the present invention reduces the computational complexity of the 
decision-feedback unit (DFU) L/U times. In addition, computational delay through the decision 
feedback cell (DFC) (FIG. 5) is reduced L/U times so that the critical path problem is relaxed 
significantly as well. The present invention also allows for a survivor depth D of the survivor 
memory unit (SMU), which is smaller than L. In a conventional RSSE circuit, the survivor depth 
D must be at least L, as the L past survivor symbols are needed for the computations in the 
decision feedback imit (DFU). Thus, the present invention also allows for a hardware reduction 
of the survivor memory unit (SMU) (FIG. 5). 

In the case of decision-feedback sequence estimation (DFSE), which is a 
specialization of the RSSE technique, and C/ = /: , the decision feedback unit (DFU) and thus the 
feedback loop is removed and the decision-feedback sequence estimation (DFSE) becomes a 
pure Viterbi decoder. Thus, for this special case of decision-feedback sequence estimation 
(DFSE) with U equal to K, the present invention permits pipelining in all processing blocks 
outside the add-compare-select unit (ACSU), and the critical path reduces to one add-compare- 
select cell (ACSC). 

While the tentative decision/tail processing circuitry 400 has been implemented in 
FIG. 4 using a decision- feedback equalizer (DFE), the tentative decision/tail processing circuitry 
400 could likewise be implemented using a soft DFE approach. For a discussion of soft DFE 
techniques, see, for example, S.L. Ariyavisitakul and Y. Li, "Joint Coding and Decision 
Feedback Equalization for Broadband Wireless Channels**, IEEE Journal on selected Areas in 
Communications, vol. 16, no. 9, Dec. 1998, incorporated by reference herein. 

As previously indicated, FIG. 5 illustrates the reduced state sequence estimation 
(RSSE) circuit of FIG. 2 which processes only the initial taps of the channel impulse response. 
As previously indicated, reduced state sequence estimation (RSSE) techniques reduce the 
complexity of the maximum likelihood sequence estimators (MLSE) by merging multiple states 
of the fiiU combined channel/code trellis. For a more detailed discussion of conventional 
reduced state sequence estimation (RSSE) techniques that process all taps of the channel impulse 
response, see, for example, P. R. Chevillat and E. Eleftheriou, "Decoding of Trellis-Encoded 
Signals in the Presence of Intersymbol Interference and Noise", IEEE Trans. Conraiim., vol. 37, 
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669-76, (July 1989) and M. V. Eyuboglu and S. U. H. Qureshi, "Reduced-State Sequence 
Estimation For Coded Modulation On Intersymbol Interference Channels", IEEE JSAC, vol. 7, 
989-95 (Aug. 1989), each incorporated by reference above. 

Reduced state sequence estimation (RSSE) considers only partial information 
about the information symbol for the reduced combined trellis. The resulting reduced combined 

state is expressed as ={^n-K'^^n-K^-^K-\) where X^l^ contains the m,- bits of the information 
symbol X^.i which are considered for the reduced trellis. It is required that 
/w'<mj^ </wj^._i <...< <m , where m* is the number of information bits which are sent into the 

convolutional encoder of the TCM encoder. The reduced trellis has 5*= 5x2'">'^" '^'"^ states. 

FIG, 5 shows a block diagram for reduced state sequence estimation (RSSE) 
which is also valid for its specializations, decision-feedback sequence estimation (DFSE) and 
PDFE. 5' soft output values are computed by the 5' decision-feedback cells (DFC) in the 
decision-feedback imit (DFU) based on the survivors in the survivor memory unit (SMU) and fed 
into the branch metric unit (BMU), in which each branch metric cell (BMC) computes the 
metrics for the 6 = 2'"' transitions emanating from the corresponding state. Each decision- 
feedback cell (DFC) takes L past symbols from the corresponding survivor memory cell (SMC). 

Decision-feedback sequence estimation (DFSE) is a specialization of reduced 
state sequence estimation (RSSE) and employs a trellis that takes into account only the first K of 
the L chaimel coefficients {/^}, 0<K<L, The combination of the code state and truncated 

channel state defines the reduced combined state ^„ = {^n-Ky^n-K^-^^n-\)^ where a„ is the code 
state at time n, and X^_j^,.,.,X^_^ are the k previously sent information symbols. Intersymbol 
interference terms not represented in the combined state are estimated and subtracted in the 
metric computation using the path history of each state. A special case arises when a: = 0 , where 
the reduced trellis becomes the TCM code trellis and decision-feedback equalization is 
performed for each code state based on the survivor history of that path. This is called parallel 
decision-feedback equalization (PDFE). Decision-feedback sequence estimation (DFSE) follows 
from reduced state sequence estimation (RSSE) with mi=... = mj^ =m. 
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In an alternate implementation, the RSSE circuit 500 may be replaced by an M- 
algorithm (MA). The M-algorithm (MA) techniques work on the complete combined trellis, but 
retain at each processing step only M paths with the best metrics. For a discussion of M- 
algorithms (MA), see, for example, N. Seshadri and J. B. Anderson, "Decoding of Severely 
Filtered Modulation Codes Using the (M,L) Algorithm", IEEE JSAC, vol. 7, 1006-1016 (Aug. 
1989), incorporated by reference herein. 

Generally, the M-algorithm (MA) uses M DFEs to cancel the intersymbol 
interference for the M best paths. At each processing cycle, each of these M paths is extended 
by its 6 = 2'"' extensions, then the resulting bM paths are tested for duplicate paths and sorted to 
find the M best paths. The M-algorithm (MA) is not as inherently parallel as the reduced state 
sequence estimation (RSSE) implementation shown in FIG. 5 or the Viterbi algorithm itself, as 
the testing and sorting operation in the add-test-sort unit (ATSoU) is performed over all bM path 
extensions. Each decision-feedback cell (DFC) takes L past symbols fi"om the corresponding 
survivor memory cell (SMC). 

In the altemate embodiment, the M-algorithm (MA) would process the combined 
trellis that results fi-om the concatenation of the TCM code and a channel with the first U taps of 
the channel impulse response. The tail of the channel impulse response would be processed with 
a lower complexity cancellation algorithm as discussed above. 

It is to be understood that the embodiments and variations shown and described 
herein are merely illustrative of the principles of this invention and that various modifications 
may be implemented by those skilled in the art without departing fi-om the scope and spirit of the 
invention. 



